Adaptive Robotics - Final Report Extending Q-Learning to Infinite Spaces
نویسندگان
چکیده
One of the drawbacks of standard reinforcement learning techniques is that they only operate when both the state and action spaces are finite. Q-learning is one such algorithm. We propose an extension of Q-learning to infinite state and action sets called CHAMPAGNE, using a simple “Local Expert” function approximation method. We then experimentally test the performance of the algorithm on several navigation tasks. The algorithm is able to successfully solve a T-maze in pyrobot after reasonable training. We present results from varying the input type, reinforcement delay, and maximum memory size for this algorithm.
منابع مشابه
Adaptive Agents for Artificial Life Domain
Adaptation and real time behavior becomes a necessity for agents that want to survive in Artificial Life environments. Agents are autonomous and therefore some form of unsupervised learning has to be used to improve and adapt their behavior in time. The neurodynamic reinforcement learning approach – Q-learning where Q-factors are represented using neural networks, overcomes problems of the clas...
متن کاملطراحی پایدارساز PSS3B بر اساس الگوریتم KH و Q-learning برای میراسازی نوسانات فرکانس پایین سیستم قدرت تکماشینه
The main purpose of this paper is to develop a supplementary signal using reinforcement learning (RL) to improve the performance of power system stabilizer (PSS). RL is one of the most important issues in the field of artificial intelligence and is the popular method for solving Markov decision procedure (MDP). In this paper, a control method is developed based on Q-learning and used to improve...
متن کاملMini/Micro-Grid Adaptive Voltage and Frequency Stability Enhancement Using Q-learning Mechanism
This paper develops an adaptive control method for controlling frequency and voltage of an islanded mini/micro grid (M/µG) using reinforcement learning method. Reinforcement learning (RL) is one of the branches of the machine learning, which is the main solution method of Markov decision process (MDPs). Among the several solution methods of RL, the Q-learning method is used for solving RL in th...
متن کاملKLearn: Stochastic Optimization Applied to Simulated Robot Actions Final Report
Machine learning techniques and algorithms are prevalent in robotics, and have been used for computer vision, grasping, and legged walking. Reinforcement learning approaches have been developed over the past 15 years, with modern techniques using continuous action spaces for various robotic applications. Policy gradient learning allows various optimization techniques to quickly optimize robotic...
متن کاملA Novel Robust Adaptive Trajectory Tracking in Robot Manipulators
In this paper, a novel adaptive sliding mode control for rigid robot manipulators is proposed. In the proposed system, since there may exist explicit unknown parameters and perturbations, a Lyapunov based approach is presented to increase system robustness, even in presence of arbitrarily large (but not infinite) discontinuous perturbations. To control and track the robot, a continuous controll...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2008